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REMOTE AUTHENTICATION BASED ON EXCHANGING 
SIGNALS REPRESENTING BIOMETRICS INFORMATION 

FIELD OF THE INVENTION 
5 The present invention relates to the field of secure communications and in particular 
to techniques of remote authentication of, and secret key establishment between, 
communicating parties to protect or secure communications over insecure channels. 

BACKGROUND 

10 Individuals or computer systems often need to have authenticated and confidential 
communications over an open channels, such as the Internet. While such secure 
communications may be achieved by physical means, it is more cost effective and 
flexible to use cryptographic means. 

15 To have secure communications using cryptographic means, parties need to first 

execute a protocol to authenticate each other and at the same time establish a mutually 
agreed conventional up on secret key, which is then used to encrypt subsequent 
communications between the parties. Conventional authentication and key exchange 
protocols normally require that, the parties either share a secret (e.g., a password) or 

20 know each other's public keys. 

A cryptographic system, or cryptosystem, uses an encryption key to convert plaintext 
into ciphertext (an unintelligible or undecipherable form of the original information) 
and a decryption key to recover the plaintext from ciphertext. If the encryption key 

25 and the decryption key are identical, the cryptosystem is referred to as symmetric key 
cryptosystem. If the encryption and decryption keys are different and it is 
computationally infeasible to determine the decryption key from the encryption key, 
the cryptosystem is referred to as an asymmetric key cryptosystem or public key 
cryptosystem. In a public key cryptosystem, anyone can encrypt a message using a 

30 public encryption key. However, only the holder of a corresponding private 
decryption key can decrypt the ciphertext and recover the message. 
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Li a public key cryptosystem, it is often important to securely bind a public key with 
the legitimate user's ID. Such a binding can be achieved using public key certificates, 
which are digitally signed and issued by a certification authority. 
5 Roughly speaking, a one-way hash function h() has the properties that: 

1) for any message m, the hash h(m) can be easily computed; 

2) given h(m), finding m is computationally infeasible; and 

3) finding two messages that have the same hash is computationally 
infeasible. 

10 For more information on cryptosystems, digital signature schemes, public key 
certificates, and one-way hash functions, reference is made to A. Menezes, P. 
Oorschot, and S. Vanstone, Handbook of Applied Cryptography , CRC Press, pp. 425- 
488, pp. 559-561, and pp. 321-383 1996; and C. Kaufman, R. Perlman, and M. 
Speciner, Network Security - Private Communication in A Public World , PTR 

15 Prentice Hall, Englewood Cliffs, NJ, pp. 152-158, pp. 177-204 and pp. 101-129 1995. 

The UNIX (a trademark of Bell Laboratories) operating system provides a classical 
example of a password based authentication system. In UNIX, each user is provided 
with a unique login user name and is allowed to choose a secret password. The UNIX 

20 system maintains a password file containing the user name and a hash of the user's 
password computed using a one-way hash function with the user's password as input. 
When a UNIX user desires to access the UNIX system, the user keys in his or her user 
name and password to a terminal. The terminal computes the hash of the password 
and sends the hash along with the user name to the UNIX system. Because only the 

25 user knows the password, if the hash and user name match those in the password file, 
the user is considered authenticated. 



The UNIX password system is simple to implement, but has a number of problems. 
Firstly, it is vulnerable to a "replay" attack. That is an eavesdropper can intercept the 
30 user name and the hash of the password, and replay them to the UNIX system. 

Secondly, knowing the hash of a password, an eavesdropper can mount an off-line 




-3- 

dictionary attack. The person can guess a password, compute its hash, and see if the 
two hash values match. The person can then systematically try passwords, one at a 
time, until a match is found. Since people tend to choose easy to remember or "weak 
passwords", such an attack can be very effective. Thirdly, the UNIX system only 
5 authenticates the user, and no secret key is established to encrypt subsequent 
interactions between the user and the system. 



A number of authentication and key establishment protocols have been proposed to 
improve upon the UNIX password protocol. Examples include: 
10 1) R. Needham and M. Schroeder, "Using encryption for 

esc 

g authentication in large networks of computers", Communications of 

f) the ACM, Vol. 21, December 1978, pp. 993-999; 

2) D. Otway and O. Rees, "Efficient and timely authentication", 
Operating Systems Review, Vol. 21, No. 1, January 1987, pp. 8-10; 
15 3) L. Gong, M. Lomas, R. Needham, and J. Saltzer, "Protecting poorly 

chosen secrets from guessing attacks IEEE Journal on Selected 
Areas of Communications, Vol. 11, No. 5, June 1993, pp. 648-656; and 
4) US Patent No. 5,440,635 issued to S. Bellovin and M. Merritt on 
August 8, 1995. 

20 

A number of the conventional authentication protocols require that the parties share 
secret information (such as a password) or possess each other's public keys in 
advance. There are many potential difficulties for a human user to share secrets with a 
large number of remote parties. Firstly, it requires a secure secret distribution 
25 mechanism to be in place. Secondly and more importantly, human users are not good 
at remembering secrets of good quality, since such secrets look like random data. 
Knowing each other's public key in authenticated manners is also problematic in a 
distributed and open environment. 



30 



Without good authentication and encryption, voice-over-IP (the Internet protocol) can 
be eavesdropped without much difficulty. Pretty Good Privacy Phone or PGPfone 
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(both are trademarks of Pretty Good Privacy Inc.) implements an authentication 
protocol based on exchange of voice signals and Diffie-Hellman key exchange 
protocol, P. Zimmermann, PGPfone Owner's Manual, Version 1.0 beta 5, 5 January 
1996, htlT)://web.mit.edu/network/pjgpfone/manuaL 

5 

Before proceeding with a discussion of the PGPfone authentication protocol, the 
Diffie-Hellman key exchange protocol, W. Diffie and M. Hellman, "New directions in 
cryptography", IEEE Transactions on Information Theory, Vol. IT-22, No. 6, pp. 644- 
654, November 1976 is reviewed. Diffie-Hellman key exchange allows two parties, 
10 without sharing keying material in advance, to agree to a secret key over an open 
,q channel, but without authentication. In Diffie-Hellman key exchange, two parties A 

;\J and B agree on an appropriate prime p and a generator of Z* p , where Z* p = {x | 0< x 

< p_i } . Party A generates a random number x, 1 <x<p-l, and then computes and 
sends to Bob g x modulo p. Party B generates a random number y, 1< y < p -1, and 
15 then computes and sends to party A g y modulo p. 

Q Party A computes a shared key k = (g y ) x modulo p, and party B computes k = (g x ) y 

■3 modulo p. The Diffie-Hellman protocol can be carried out in any group in which the 

!sJ discrete logarithm problem is difficult to solve. This protocol, however, is vulnerable 

20 to "man-in-the-middle" attacks. If a party C comes in the middle between parties A 
and B, when party A wishes to have a Diffie-Hellman exchange with party B, party C 
intercepts all the messages from A and B and enters the Diffie-Hellman exchange with 
A and B, respectively. As a result, C agrees a secret key with A and another secret key 
with B so that C can decrypt all the messages from A using the key shared with A and 
25 re-encrypt the messages using the key shared with B. 



The PGPfone authentication protocol assumes that the two parties are familiar with 
each other's voice. The two parties first establish a shared value (e.g., g xy mod p) by 
performing a Diffie-Hellman exchange. The parties next compute the hash of the 
30 shared value. Each party then reads the first few bytes (in hexadecimal format or in 
English words. PGPfone; maintains a list that maps the 256 values of a byte to 256 
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English words) of the hash to each other. If the bytes at the two ends match and if the 
voice sounds like that of the claimed party, the parties are considered authentic. 
However, if an attacker is able to collect sound samples of all the 256 words by, for 
example, eavesdropping on someone's phone calls, the attacker is able to impersonate 
5 the victim at will. 

Thus, a need clearly exists for a method of remote authentication based on exchanging 
signals representing biometrics information and establishing a cryptographic key. 

10 SUMMARY 

In accordance with one aspect of the invention, a method of authenticating a 
remote party and establishing a cryptographic key for secure communications via an 
insecure communications channel. The method includes the steps of: 

generating a first challenge signal of minimum duration T, where T is a fixed 
15 time interval; 

generating a random number x, computing g x modulo p, where g and p are 
numbers, deriving a key k A from g x modulo p, encrypting the first challenge signal 
with k A and a symmetric key cryptosystem, and sending a first ciphertext to the remote 
party; 

20 receiving a second ciphertext from the remote party, sending g x modulo p to 

the remote party, and starting a clock; 

receiving a third ciphertext and g y modulo p from the remote party, stopping 
the clock, and computing an elapsed time interval of the clock; 

deriving a key k B from g y modulo p, computing g xy modulo p, deriving a key 
25 kAB from g xy modulo p, decrypting the second ciphertext with ke to recover a second 
challenge signal from the remote party, decrypting the third ciphertext to recover a 
first response signal from the remote party; 

verifying that the elapsed time of the clock is within a predetermined interval 
(TL A , TU A ), where TL A and TU A are positive numbers; 
30 verifying that the second challenge signal is produced by the remote party; 
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producing a second response signal of minimum duration T, encrypting the 
second response signal with Icab and sending a fourth ciphertext to the remote party; 

verifying that the first response signal is a response produced by the remote 
party to the first challenge signal; and 
5 generating a key k from g xy modulo p for secure communications with the 

remote party. 

Correspondingly, an apparatus and a computer program product based on the 
foregoing method are also disclosed. 

10 

In accordance with another aspect of the invention, there is disclosed a method 
of authenticating a remote party and establishing a cryptographic key for secure 
communications via an insecure communications channel. The method includes the 
steps of: 

1 5 receiving a first ciphertext from the remote party, generating a random number 

y, computing g y modulo p, where g and p are numbers; 

producing a first challenge signal of a minimum duration T, where T is a fixed 
time interval; 

deriving a key k B from g y modulo p, encrypting the first challenge signal with 
20 ke and a symmetric key cryptosystem, and sending a second ciphertext to the remote 
party; 

receiving g x modulo p from the remote party, deriving a key k A from g x 
modulo p, decrypting the first ciphertext to recover a second challenge signal from the 
remote party; 

25 verifying that the second challenge signal is produced by the remote party, 

producing a first response signal of the minimum duration T; 

computing g xy modulo p, deriving a key kAB from g xy modulo p, encrypting the 
first response signal, sending a third ciphertext and g y modulo p to the remote party, 
and starting a clock; 
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receiving a fourth ciphertext, stopping the clock, computing the elapsed time 
of the clock, and decrypting the fourth ciphertext to recover a second response signal 
from the remote party; 

verifying that the elapsed time of the clock is within a predetermined interval 
5 (TLb, TUb), where TL B and TUb are positive numbers; 

verifying that the second response signal is a response produced by the remote 
party to the first challenge signal; and 

generating a key k from g xy modulo p for secure communications with the 
remote party. 

10 

Correspondingly, an apparatus and a computer program product based on the 
foregoing method are also disclosed. 

In accordance with yet another aspect of the invention, there is disclosed a 
15 method of authenticating a remote party and establishing a cryptographic key for 
secure communications via an insecure communications channel. The method 
includes the steps of: 

generating a first challenge signal of minimum duration T, where T is a fixed 
time interval; 

20 generating a random number x, computing g x modulo p, where g and p are 

numbers, deriving a key k A from g x modulo p, encrypting the first challenge signal 
with kA and a symmetric key cryptosystem, and sending a first ciphertext to the remote 
party; 

receiving a second ciphertext, sending g x modulo p to the remote party, and 
25 starting a clock; 

receiving g y modulo p, computing a key k B from g y modulo p, decrypting the 
second ciphertext to recover a second challenge signal from the remote party; 

verifying the second challenge statement to ensure that the second challenge 
statement is produced by the remote party, and producing a first response signal of 
30 minimum duration T; 
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computing g xy modulo p, deriving a key Icab from g xy modulo p, encrypting the 
first response signal and sending a third ciphertext to the remote party; 

receiving a fourth ciphertext from the remote party, stopping the clock, 
decrypting the fourth ciphertext with kAB to recover a second response signal from the 
5 remote party; 

verifying that the elapsed time of the clock is within a predetermined interval 
(tl A , tu A ), where tl A and tu A are positive numbers; 

verifying that the second response signal is a response produced by the remote 
party to the first challenge signal; and 
10 generating a key k from g xy modulo p for secure communications with the remote 
party. 

Correspondingly, an apparatus and a computer program product based on the 
foregoing method are also disclosed. 

15 

In accordance with a further aspect of the invention, there is disclosed a 
method of authenticating a remote party and establishing a cryptographic key for 
secure communications via an insecure communications channel. The method 
includes the steps of: 

20 receiving a first ciphertext from remote party, generating a random number y, 

and computing g y modulo p, where g and p are numbers; 

producing a first challenge signal of minimum duration T, where T is a fixed 
time interval; 

deriving a key k B from g y modulo p, encrypting the first challenge signal with 
25 k B and a symmetric key cryptosystem, and sending a second ciphertext; 

receiving g x modulo p, computing a key k A from g x modulo p, decrypting the 
first ciphertext to recover a second challenge signal from remote party, sending g y to 
remote party and starting a clock; 

verifying the second challenge statement to make sure that the second 
30 challenge statement is produced by the remote party, and then producing a first 
response signal of minimum duration T; 



computing g xy modulo p, deriving a key Icab from g xy modulo p, encrypting the 
first response signal and sending a third ciphertext to the remote party; 

receiving a fourth ciphertext from the remote party, stopping the clock, 
decrypting the fourth ciphertext with kAB to recover a second response signal from the 
remote party; 

verifying that the elapsed time of the clock is within an interval (tl B , tu B ), 
where tie and tu B are positive numbers; 

verifying that the second response signal a response produced by the remote 
party to the first challenge signal; and 

generating a key k from g xy modulo p for secure communications with the 
remote party. 

Correspondingly, an apparatus and a computer program product based on the 
foregoing method are also disclosed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

A small number of embodiments of the invention are described hereinafter with 
reference to the drawings, in which: 

Fig. 1 is a block diagram illustrating a communication system model between two 
remote individuals over an open transmission channel; 

Fig. 2 is a flowchart depicting the operation of a first embodiment of the invention; 
Fig. 3 is a flowchart showing the operation of a second embodiment of the invention; 
Fig. 4 is a flowchart depicting the first scenario of a man-in-the-middle attack in the 
first embodiment of Fig. 2; 

Fig. 5 is a flowchart showing the second scenario of a man-in-the-middle attack in the 
first embodiment of the invention of Fig. 2; 

Fig. 6 is a block diagram illustrating a communication device in accordance with the 
first embodiment of the invention; and 

Fig. 7 is a block diagram of a general-purpose computer with which the embodiments 
of the invention can be practised. 
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DETAILED DESCRIPTION 

A method, an apparatus, and a computer program product for remote authentication 
based on exchanging signals representing biometrics information and establishing a 
cryptographic key are described. In the following description, numerous details are 
set forth including communications channels for example. It will be apparent to one 
skilled in the art, however, that the present invention may be practised without these 
specific details. In other instances, well-known features are not described in detail so 
as not to obscure the present invention. 



The detailed description is organised as follows: 

1 . Notation and Definitions 

2. Block Diagram of Communications Device 

3. First Embodiment 
15 4. Second Embodiment 

5 . Security Considerations 

6. Computer Implementation 



In the following description, components of the system are described as modules. A 
20 module, and in particular its functionality, can be implemented in either hardware or 
software. In the software sense, a module is a process, program, or portion thereof, 
that usually performs a particular function or related functions. In the hardware sense, 
a module is a functional hardware unit designed for use with other components or 
modules. For example, a module may be implemented using discrete electronic 
25 components, or it can form a portion of an entire electronic circuit such as an 

Application Specific Integrated Circuit (ASIC). Numerous other possibilities exist. 
Those skilled in the art will appreciate that the system can also be implemented as a 
combination of hardware and software modules. 



30 1 . Notation and Definitions 



The following notation is used throughout: 
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A, B : Parties (Alice and Bob, respectively) that seek to have 

secure communications; 
e(k, m) : Encrypted message containing original message m and 

a key k using a symmetric key cryptosystem; 
5 d(k, c) : Decrypted message containing a ciphertext c and the 

key k using a symmetric key cryptosystem; 
Cx : An acoustic wave or digital representation of a 

challenge statement spoken by a party X (either A or 

B); whether Cx is an acoustic wave or a digital 
10 representation should be clear to those skilled in the art 

from the discussion context; 
R Y : An acoustic wave or digital representation of a response 

statement spoken by a party Y in reply to Cx; whether 

R Y is an acoustic wave or a digital representation 
1 5 should be clear to those skilled in the art from the 

discussion context; 
|Cx| : the time duration of C x ; 

|Ry| : the time duration of Ry; and 

T : A required minimum time duration of any statement 

20 spoken by a party. 

In the following description, "secure communications" means communications that 
are authenticated or confidential. 



Fig. 1 illustrates a general model 100 of communications between two remote 
25 individuals. In this model 100, Alice 110 and Bob 150 are two individuals, who are 
familiar with each other's biometrics characteristics (without loss of generality, voice 
signals are used hereinafter) and wish to have a secure communication. The 
transmission channel 130 represents the means, and more specifically the media, 
through which communication messages are exchanged between the communication 
30 devices 120, 140. The transmission channel 130 includes, but is not limited to, any 
communications means or media such as computer networks, public telephone 
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switching networks, and radio links. Alice 110 and Bob 150 communicate with each 
other by interfacing through communication devices (A) 120 and (B) 140, 
respectively. Devices (A, B) 120, 140 have appropriate speech signal processing 
capabilities (such as speech encoding and decoding). The devices 120, 140 accept 
5 audio input, either directly or indirectly, from Alice 110 and Bob 140, respectively, 
and output the other party's received audio signal. 

In the embodiments of the invention, devices (A, B) 120, 140 each exchange signals 
using a Diffie-Hellman key exchange system and a symmetric key cryptosystem. For 
^ 10 the purposes of illustration only, both devices (A, B) 120, 140 are assumed to have 

D access to a common set of Diffie-Hellman parameters, g and p, which may be 

H 

distributed either off-line or on-line. 

^ The minimum duration of any statement spoken by a party 110, 150 is required to be 

1 5 T, where T is either pre-fixed or agreed upon by Alice 110 and Bob 150. For security 
7i reasons, T should be much longer than the channel round-trip delay and the processing 

^ delay of each device 120, 140. For example, T may be in the range of tens of seconds 

□ to several minutes, while channel round-trip delay and processing delays are normally 

^ less than one second. To keep notation compact, only residues modulo are used 

20 hereinafter: that is, g x modulo p, g y modulo p, and g xy modulo p are written simply as 
g x , g y , and g xy , respectively. In addition, while not mentioned explicitly, devices (A 
and B) 110, 150 are assumed to perform audio encoding/decoding and audio 
compression/decompression operations, whenever necessary. 

25 In the embodiments of the invention, the communicating parties are assumed to be 
familiar with each other's voice (biometrics characteristics in general) and able to 
recognise each other by listening to each other's speech. This is a reasonable 
requirement, since there are generally no confidential topics between two strangers 
without the involvement of a trusted third party. In the embodiments, party 110 

30 (Alice) starts a session with party 150 (Bob) by speaking a challenge statement such 



as: 
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"This is Alice! The time is 21 minutes passed 9AM. How was your 
mid-term examination, Bob?". 



Upon hearing Alice's message, party 150 (Bob) is expected to speak a response 
5 statement like: 

"Hi, Alice! Bob's here. My mid-term exam was not very good. But 
thank God, it is over!". 

In the embodiments, Alice is assumed to be able to distinguish whether the stated 
10 response is in Bob's voice and whether the stated response corresponds to her 

challenge statement in the first place. Furthermore, it is assumed that it is difficult for 
an attacker to mimic a target party's voice to produce a meaningful response statement 
in real-time. 

15 2. Block Diagram of Communications Device 

Fig. 6 is a block diagram of a communication device 600 in accordance with the 
embodiments of the invention. The communications device 600 includes a speech 
encoding/decoding module 632, a control module 636, an encryption/decryption 
module 640, a key generator 650, a Diffie Hellman key exchange system 660, a timer 

20 module 670, an input/output (I/O) module 680 for transmitting and receiving data via 
the communications channel 610, and a memory 682. Optionally, the device 600 also 
includes a transducer module 630 and an audio compression/decompression module 



25 The control module 636 controls the components of the device 600 via signals 690A - 
6901. The transducer module 630 converts audio 620 into audio signals and vice 
versa. Operation of the transducer module 630 can be controlled by the control 
module 636 via a control signal(s) 690A. In turn, the transducer 630 is coupled 
bidirectionally with the speech encoding/decoding module 632. This module 632 

30 speech encodes and decodes data and is controlled by the control module 636 via the 
signal(s) 690B. Optionally, the speech encoding/decoding module 632 is coupled 



634. 
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bidirectionally to an audio compression/decompression module 634. Otherwise, the 
module 632 can be directly coupled to an encryption/decryption module 640. The 
control module 636 is coupled to the audio compression/decompression module 634 
via the control signal 690C. 

5 

The encryption/decryption module 640 is optionally coupled to the audio 
compression/decompression module 634 and is coupled to the control module 636 via 
a control signal(s) 690D. The encryption/decryption module 640 is also coupled to 
memory 682, Diffie Hellman key exchange system 660, timer module 670, and the I/O 
^ 10 module 680 by bus 692. The key generator 650 is coupled between the Diffie 

0 Hellman key exchange system 660 and the encryption/decryption module 640. The 

jag, key generator 650 produces a key k from a code G and is coupled to the control 

'% module 636 via a control signal(s) 690E. The memory 682 is coupled to the control 

if; module 636 via a control signal(s) 690G. 

T 15 

The Diffie Hellman key exchange system 660 has as inputs code G, prime number P, 
j«* random value X, and a control signal(s) 690F from control module 636. The timer 

q module 670 has an interrupt (INT) output to the I/O module 680 that allows the timer 

^ module 670 to interrupt transmission/reception of data via the I/O module 680. The 

20 timer module 670 is coupled to the control module 636 via a control signal 6901. The 

I/O module 680 is coupled to the control module 636 by control signal(s) 690H. 

Operation of the communication device 600 is described hereinafter with reference to 

the first and second embodiments. 



25 The embodiments of the invention advantageously employ mechanisms that enable 
users to authenticate each other and have secure communications over open or 
insecure channels by using a different technique. More particularly, users are not 
required to share or remember any secret key or password, or possess each other's 
public keys in advance. Authentication and key establishment are achieved by 

30 exchanging signals representing a remote party's biometrics information. In this 
connection, the embodiments of the invention utilize a cryptographic one-way hash 
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function. The embodiments concentrate on authenticating remote human users based 
on the interaction of signals representing a remote party's biometrics information 
(such as acoustics waves). Parties are assumed to be able to identify each other based 
on the exchanged biometrics signals. There is no need for the parties to share any 
5 secret key, or know each other's public key in advance. The embodiments of the 
invention can be advantageously employed in applications such as Internet 
telephoning or voice-over-IP (Internet Protocol). 

3. First Embodiment 

10 Fig. 2 is a flowchart illustrating the method of secure communications according to 
the first embodiment of the invention. The flowchart is organised in columns in the 
following order: Alice 110, device (A) 120, device (B) 140, and Bob 150. In step 
210, Alice 110 speaks a challenge statement C A , which is input to Device A (shown as 
device 600 in Fig. 6). Preferably, the challenge statement Ca contains some 

15 "freshness" elements, such as the date and time, and news headlines of the day. In 
step 212, Device A generates a random number x and computes g x . Device A then 
preferably computes a key k A from code g x using the key generator. Next, Device A 
encrypts the challenge statement C A with key k A using the key generator and sends the 
ciphertext, referred to as Message Al: 

20 e(k A , C A ), 

preferably together with Alice's identity to Device B over the transmission channel 
610. 

In step 214, Device B receives the Message Al and prompts Bob to speak a challenge 
25 statement Cb in step 216. In step 218, Device B generates a random number y, 
computes g y and preferably a key k B from the code g y for a symmetric key 
cryptosystem. Device B then encrypts Cb and transmits the ciphertext, referred to as 
Message A2: 

e(k B , C B ), 

30 preferably together with Bob's identity to Device A. 
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In step 220, Device A receives the Message A2. In step 222, Device A sends to 
Device B the code, referred to as Message A3: 

g x , 

and starts a clock or timer to measure the time interval for a response. In step 224, 
5 after receiving code g x , Device B computes the key k A from the code g x and decrypts 
the ciphertext received in Message Al to recover the challenge statement Ca- Device 
B then plays back the challenge statement Ca to Bob. In step 226, Bob listens to the 
challenge statement C A and verifies that the voice belongs to Alice. If the verification 
fails, Bob terminates the session. Otherwise, in step 228, Bob speaks a response 
10 statement Rb in reply to the challenge statement Ca (e.g., by iterating Ca in his own 
voice). 

In step 230, Device B computes the code (g x ) y = g xy and a key kAB from g xy in well- 
known fashion for the symmetric key cryptosystem, and encrypts R B with kAB and the 
15 symmetric key cryptosystem to obtain e(kAB, Rb). In step 232, Device B sends, as 
Message A4, the following: 

g y , e(kAB> Rb), 
to Device A and then starts its clock. 

20 In step 234, upon receipt of Message A4, Device A first stops its clock started in step 
222. Time t A is the elapsed time of the clock. Device A then computes the code (g y ) x 
= g 5 *, the key kAB from code g**, and the key k B from the code g y . Device A then 
decrypts e(kB, Cb) with the key ks and e(kAB, Rb) with key kAB to recover the 
challenge statement Cb and the response Rb, respectively. |Ca| is the duration of the 

25 challenge statement Ca, and |R B | is the duration of the response R B . It will be readily 
apparent to those skilled in the art how to obtain the duration of an audio signal. Let 
TL A = |Ca| + |Rb|. Further, in step 230, Device A checks the elapsed time t A to see if: 



where TUa can be taken advantageously as T + TL A if the channel round trip delay 
30 and processing delay in Devices A and B 120, 140 are negligible compared with T. 

Such delays can be easily incorporated into Equation (1) by those skilled in the art. If 



t A e (TL A , TU a ), 



(1) 
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10 



■ 15 



20 



t A e (TL A , TU a ) is not true, Device A terminates the session. Otherwise, in step 236, 
Alice listens and verifies the challenge statement Cb. 

If Alice recognises that the challenge statement Cb is not Bob's voice, Alice 
terminates the session. Otherwise, in step 238, Alice speaks a response statement R A 
(e.g., by iterating Cb in her own voice) in reply to the challenge statement Cb. hi step 
240, Device A encrypts the response statement R A with the key k A B and sends to 
Device B the ciphertext, as Message A5: 
eCkAB, R A ). 

In step 248, following step 238, Alice listens and verifies the response statement R B . 
If the response statement Rb is not either a response to the challenge statement C A or 
in Bob's voice, Alice stops the session. If Alice is sure that the response statement Rb 
is a reply to the challenge statement C A in Bob's voice, she begins communicating in 
step 250. 

Next, from step 240 in step 242, upon receipt of Message A5, Device B stops its clock 
or timer. The time t B is the elapsed time of the clock. Also, Device B decrypts e(k AB , 
R A ) with the key kAB to obtain the response statement R A . Let |Cb| denotes the 
playback duration of C B and |R A | the playback duration of R A . Let TL B = |Cb| + |R A |. 
In step 242, Device B checks the elapsed time to see if: 



where TUb can be taken advantageously as T + TLb if the channel round trip delay 
and processing delay at Devices A and B are negligible in comparison with T. If 
Equation (2) is not satisfied, Device B terminates the session. Otherwise, Device B 
plays back the response statement R A to Bob. In step 244, Bob listens and verifies the 
response statement R A . If Bob recognises that R A is a reply to the challenge statement 
Cb in Alice's voice, Bob can be confident that he is communicating with Alice and he 
can proceed to step 246. Otherwise, Bob stops the session. 



t B e (TL B , TU b ), 



(2) 



30 



In steps 250 and 246, respectively, Bob and Alice preferably communicate with each 
other. In step 252, Device A encrypts messages from Alice and decrypts messages 
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from Bob with a key derived from g xy , preferably using another symmetric key 
cryptosystem. Similarly, in step 256, Device B likewise encrypts messages from Bob 
and decrypts message from Alice with a key derived from g xy in the same way as 
Device A and with the same symmetric key cryptosystem as used by Device A. 

5 

4. Second Embodiment 

Fig. 3 is a flowchart illustrating a method of secure communications according to the 
second embodiment of the invention. In step 310, Alice starts the session by speaking 
a challenge statement C A . In step 312, Device A generates a random value x, 
10 computes a code g x and a key k A from g x for a symmetric key cryptosystem, encrypts 
the challenge statement C A with k A and sends to B the ciphertext, as Message Bl : 
e(k A , C A ). 

In step 314, Device B receives Message Bl. Next, in step 316, Device B prompts Bob 
15 to speak a challenge statement Cb in step 318. In step 316, Device B then generates a 
random number y, computes code g y and a key k B from g y for a symmetric key 
cryptosystem, encrypts Cb with k B and sends to Alice the ciphertext, as message B2: 
e(k B , C B ). 

20 In step 320, Device A receives Message B2. Next, in step 322, Device A sends to 
Bob, as Message B3: 

g x , 

and starts a clock. 

25 In step 324, upon receipt of the Message B3, Device B computes a key k A from code 
g\ decrypts e(k A , C A ) to recover C A , sends to Alice as Message B4: 

g y , 

and starts a clock. Device B then outputs the challenge statement C A to Bob. In step 
326, Bob listens and verifies whether the challenge statement C A is in Alice's voice. 
30 Bob terminates the process if he has doubts on the originality of the challenge 

statement C A . When the challenge statement C A is verified successfully by Bob, in 
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step 328, Bob speaks a response statement Rb in reply to the challenge statement Ca- 
In step 330, Device B computes code g xy and a key kAB from g xy , encrypts Rb and 
sends to Alice the ciphertext, as Message B5 : 
e(kAB, Rb). 

On the other hand, Message B4 is received by Device A in step 332. Device A 
computes ks from g y , decrypts e(ks, Cb) to recover Cb. Alice listens and verifies Cb in 
step 334. Alice stops the process if she believes that Cb is not in Bob's voice; 
otherwise, she speaks a response statement R A in reply to Cb in step 336. In step 338, 
Device A computes g** and kAB from g**, encrypts Ra and sends to Bob the ciphertext, 
as Message B6: 
e(kAB, Ra). 

In step 340, Device A receives Message B5. Device A also stops the clock, decrypts 
e(kAB> Rb) to recover the response statement Rb and checks to see if the elapsed time 
of the clock T A satisfies the following: 



where preferably tl A = |Ca| + |Rb| and tu A = tl A + T. Device A terminates the session 
if Equation (3) is not satisfied. Otherwise, Device A outputs the response statement 
R B to Alice. In step 342, Alice listens and verifies the response statement R B . Alice 
stops the session if she is not convinced that the response statement Rb is Bob's 
response to the challenge statement Ca- Otherwise, in step 344, Alice starts 
communications with Bob. 

In step 348, upon receipt of the Message B6, Device B stops its clock, decrypts e(kAB> 
Ra) to recover R A . Device B, then checks to see if the elapsed time of its clock 
satisfies the following: 



where preferably tls = |Cb| + |Ra| and Uib = tl B + T. Device B terminates the session if 
Equation (4) is not satisfied. Otherwise, Device B outputs the response statement R A 
to Bob. In step 350, Bob listens and verifies Ra. Bob stops the session if he is not 



T A e (tl A , tu A ), 



(3) 



T B e (tl B , tu B ) 



(4) 
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convinced that the response statement Ra is Alice's response to Cb. Otherwise, in 
step 352, he starts communications with Alice. 

With successful authentication of both Alice and Bob, Devices A and B, derive a key 
5 from g xy and use the key to encrypt and decrypt messages between Alice and Bob in 
steps 346 and 354, respectively, using communications obtained in steps 344 and 352. 

5. Security Considerations 

Symmetric key cryptosystems are used in the embodiments to encrypt challenge 
10 signals and response signals. It is important that all encryptions resist data 
modification (such as cut and paste) attacks. 

The symmetric key cryptosystem used to encrypt challenge and response signals can 
be replaced by a cryptographic commitment function. Such a commitment function 
1 5 has the following properties: 



One way to form a commitment function is using a cryptographic one-way hash 
function h(). To commit to an item I, the committing party computes the commitment 
h(k || I), where k is a secret key and k || I is the concatenation of k and I. To verify the 
commitment, the verifying party must have k and I, compute h (k || I) and compare h 
25 (k || I) with the commitment. 

Checking the lengths of the elapsed time intervals of the clocks in both illustrative 
embodiments is very important in detecting any man-in-the-middle attacks. This is 
illustrated using the two attacking scenarios respectively, in relation to the first 
30 embodiment of Fig. 2. A similar analysis can be done for the second embodiment. In 
the following descriptions, the term "Alice" is used to refer to both the user Alice and 



2) 



no one can modify the contents of the commitment without being 
detected; and 

no one can get any information about its contents unless the 
committing party discloses the information. 



20 
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Device A. Similarly, the term "Bob" is used to denote both the user Bob and Device 
B. 

Fig. 4 depicts a scenario where Alice attempts to set up a communications session 
5 with Bob and where Clark performs a man-in-the-middle attempt to impersonate Bob 
to Alice. The flow diagram is accordingly organised into three columns: Alice, Clark 
and Bob. In step 410, Alice starts a session by generating a random number x, 
computing g x , speaking a challenge statement C A , computing k A from g x , encrypting 
Ca with k A , and sending the ciphertext e(k A , C A ) to Bob. 

10 

D In step 412, the ciphertext is intercepted by Clark. Clark generates a number z, 

^ computes g z and a key kc from g z , encrypts an old challenge statement C' B from Bob, 

2 and sends the ciphertext e(kc, C' B ) to Alice. In step 414, Alice receives the 

f; ciphertext, replies with g x and starts a clock. The code g x is again intercepted by Clark 

15 at step 416. In step 416, Clark derives the key k A from g x and decrypts e(k A , C A ) to 
™1 recover C A . Clark cannot mimic Bob's voice to produce a meaningful response 

3 statement R B , so Clark impersonates Alice and starts a new session with Bob by 
□ sending Bob e(kc, C A ). 

20 In step 418, upon receipt of e(kc, C A ), Bob generates a random value y, and computes 
code g y and a key k B . Bob then speaks a challenge statement C B , encrypts the 
challenge statement using key k B , and sends the ciphertext e(k B , Cb) to Alice. Clark 
intercepts the ciphertext in step 420. To continue impersonating Alice, Clark sends g z 
to Bob. In step 422, Bob computes key kc from g z , decrypts e(kc, C A ), and listens C A , 

25 which was indeed spoken by Alice. Bob speaks a response statement R B , computes a 
key k B c from g 2 *, encrypts R B with k B c, and transmits g y and the ciphertext e(k BC , Rb). 
In step 424, Clark again intercepts the ciphertext, computes the key k B c from g 72 , and 
decrypts e(k BC , R B ). Now Clark gets R B . Clark then computes k A c from g 512 , encrypts 
R B with k A c, and sends the ciphertext e(k A c, R B ) to Alice. 

30 
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In step 426, Alice stops the clock, decrypts e(kc, C'b) and e(k AC , Rb) to recover C'b 
and Rb, respectively. Alice also listens and verifies that C'b is in Bob's voice. Alice 
then speaks and encrypts a response statement R A , and sends the ciphertext e(k A c> Ra). 



10 



1 = 5 



Next, in step 428, Alice listens and verifies the response statement R B . Since Rb is 
indeed a response to Ca from Bob, Alice is fooled into believing Clark is Bob. 
However, the embodiments of the invention prevent this from happening by checking 
the clock's elapsed time t A against the interval (TL A , TU A ) per Equation (1), where 
TL A = |C A | + |Rb|, and TU A = TL A + T. 



To appreciate the rationale behind Equation (1), without the man-in-the-middle attack 
by Clark, t A = |C A | + |Rb| + A Ai , where A A i is the delay due to processing, 
transmission, and a pause interval introduced by Bob after listening to the challenge 
statement C A , but before speaking the response statement Rb. However, with the man- 
15 in-the-middle attack shown in Fig. 4, t A = |C A | + |Rb| + |Cb| + Aa2, where Aa2 is a delay 
similar to A A i. Since it is required that T » A A i and Aa2 and that |Cb| ^ T. It can be 
readily appreciated that t A = |C A | + |Rb| + |Cb| + A^ > TL A + T = TU A with the attack 
and that t A = |C A | + |Rb| + A A i » TL A < TU A without the attack. Therefore, by 
checking t A against Equation (1), the man-in-the-middle attacked can be detected. 

20 

Fig. 5 shows a second scenario 500 of a man-in-the-middle attack, where Clark 
impersonates Alice to Bob. Again the flow diagram is organised in columns: Alice, 
Clark, and Bob. In step 510, Clark generates z, computes g z and a key kc from g z , and 
encrypts C' A - an old statement from Alice. Clark starts the impersonation by sending 
25 the ciphertext e(k c , C' A ) to Bob. 

In step 512, upon receipt of the message from Clark, Bob generates y, g y , and a key k B 
from g y . Bob then speaks a challenge statement Cb, encrypts the challenge statement 
with k B , and transmits the ciphertext e(k B , Cb) to Alice. In step 514, the ciphertext is 
30 intercepted by Clark and Clark sends g z to Bob. In step 516, Bob derives the key kc 
from g z , and decrypts e(kc, C' A ) with kc to recover C' A . Bob listens to the challenge 
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statement C\ and believes that C'a was indeed spoken by Alice. Bob then speaks a 
response statement R B , derives a key k B c from g 2 *, encrypts R B , transmits the 
ciphertext e(k B c ? Rb) and g y and starts a clock. 

Next, in step 518, upon interception of the ciphertext e(k B c, Rb) and g y , Clark derives 
k B from g y and k B c from g 2 ^, and decrypts e(k B , C B ) to recover C B . Since Clark is not 
able to reply to the challenge statement C B in Alice's voice, Clark encrypts the 
challenge C B with kc and starts a session with Alice by sending e(kc, C B ) to Alice. In 
step 520, upon receipt of e(kc, C B ), Alice generates x, g x , and a key kA from g x . Alice 
then speaks a challenge statement Ca, encrypts the challenge statement with k A , and 
sends the ciphertext e(k A , Ca). In step 522, Clark intercepts the ciphertext and sends 
g z to Alice. In step 524, Alice derives a key k A c from g zx , and decrypts e(kc, C B ) to 
recover C B . Alice then listens to the challenge statement C B and believes that she 
hears Bob's voice. Alice then speaks a response statement R A , encrypts the response 
statement with k A c and sends g x and the ciphertext e(k A c> Ra). In step 526, Clark 
intercepts the message from Alice and decrypts the ciphertext to obtain the response 
R A . Clark then encrypts the response statement with k B c and sends the ciphertext 
e(k BC , R A ) to Bob. 

20 In step 528, Bob receives e(k BC , Ra), stops the clock, and decrypts the ciphertext to 
recover R A . Without checking the elapsed time of the clock, t B , Bob can be misled 
into believing that Bob is communication with Alice since R A is Alice's reply to the 
challenge statement C B . This attack can be foiled easily by checking t B against the 
interval (TL B , TU B ) (see Equation 2), where TL B = |C B | + |R A | and TU B = TL B + T. 

25 Without the man-in-the-middle attack by Clark, t B = |C B | + |Ra| + A B i, where A Bi is the 
delay due to processing, transmission, and a pause interval introduced by Alice after 
listening to C B but before speaking the response statement R A . With the man-in-the- 
middle attack of Fig. 5, t B = |C B | + Ra| + |C A | + A B2 , where A B2 is a delay similar to 
A B i . It is required that T » A B i and A B2 and that |C A | ^ T. Then it can be seen that t B = 

30 |C B | + R A | + |C A | + A B2 < TL B + T = TU B with the attack and t B = |C B | + Ra| + A B i « 
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TLb < TUb without the attack. Therefore, by checking t B against Equation (2), the 
man-in-the-middle attack can be detected. 

6; Computer Implementation 

5 The embodiments of the invention are preferably implemented using a computer(s), 
such as the general-purpose computer shown in Fig. 7. In particular, the processes of 
Figs. 2 and 3 can be implemented as software, or a computer program, executing on 
the computer, where each communication device 120, 140, 600 of Figs. 1 and 6 can be 
implemented using a general purpose computer. The method or process steps for 

10 remote authentication based on exchanging signals representing biometrics 

information are effected by instructions in the software that are carried out by the 
computer. The software may be implemented as one or more modules for 
implementing the process steps. A module is a part of a computer program that 
usually performs a particular function or related functions. Also, as described 

1 5 hereinbefore, a module can also be a packaged functional hardware unit for use with 
other components or modules. 

In particular, the software may be stored in a computer readable medium, including 
the storage devices described below. The software is preferably loaded into the 

20 computer from the computer readable medium and then carried out by the computer. 
A computer program product includes a computer readable medium having such 
software or a computer program recorded on it that can be carried out by a computer. 
The use of the computer program product in the computer preferably effects an 
advantageous apparatus for remote authentication based on exchanging signals 

25 representing biometrics information in accordance with the embodiments of the 
invention. 



The computer system 700 consists of the computer 702, a video display 716, and input 
devices 718, 720. In addition, the computer system 700 can have any of a number of 
30 other output devices including line printers, laser printers, plotters, and other 

reproduction devices connected to the computer 702. The computer system 700 can 
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be connected to one or more other computers via a communication interface 708b 
using an appropriate communication channel 730 such as a modem communications 
path, a computer network, or the like. The computer network may include a local area 
network (LAN), a wide area network (WAN), an Intranet, and/or the Internet. 

5 

The computer 702 itself consists of a central processing unit(s) (simply referred to as a 
processor hereinafter) 704, a memory 706 which may include random access memory 
(RAM) and read-only memory (ROM), input/output (IO) interfaces 708 A, & 708B a 
video interface 710, and one or more storage devices generally represented by a block 

10 712 in Fig. 8. The storage device(s) 712 can consist of one or more of the following: 
a floppy disc, a hard disc drive, a magneto-optical disc drive, CD-ROM, magnetic 
tape or any other of a number of non- volatile storage devices well known to those 
skilled in the art. Each of the components 704 to 712 is typically connected to one or 
more of the other devices via a bus 714 that in turn can consist of data, address, and 

15 control buses. 



The video interface 710 is connected to the video display 716 and provides video 
signals from the computer 702 for display on the video display 716. User input to 
operate the computer 702 can be provided by one or more input devices 708B. For 
20 example, an operator can use the keyboard 718 and/or a pointing device such as the 
mouse 720 to provide input to the computer 702. 



The system 700 is simply provided for illustrative purposes and other configurations 
can be employed without departing from the scope and spirit of the invention. 

25 Computers with which the embodiment can be practiced include IBM-PC/ATs or 
compatibles, one of the Macintosh ™ family of PCs, Sun Sparcstation ™, a 
workstation or the like. The foregoing are merely exemplary of the types of 
computers with which the embodiments of the invention may be practiced. Typically, 
the processes of the embodiments, described hereinafter, are resident as software or a 

30 program recorded on a hard disk drive (generally depicted as block 712 in Fig. 7) as 
the computer readable medium, and read and controlled using the processor 704. 
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Intermediate storage of the program and pixel data and any data fetched from the 
network may be accomplished using the semiconductor memory 706, possibly in 
concert with the hard disk drive 712. 

5 In some instances, the program may be supplied to the user encoded on a CD-ROM or 
a floppy disk (both generally depicted by block 712), or alternatively could be read by 
the user from the network via a modem device connected to the computer, for 
example. Still further, the software can also be loaded into the computer system 700 
from other computer readable medium including magnetic tape, a ROM or integrated 

10 circuit, a magneto-optical disk, a radio or infra-red transmission channel between the 
computer and another device, a computer readable card such as a PCMCIA card, and 
the Internet and Intranets including email transmissions and information recorded on 
websites and the like. The foregoing are merely exemplary of relevant computer 
readable mediums. Other computer readable mediums may be practiced without 

15 departing from the scope and spirit of the invention. 

Thus, a method, an apparatus, and a computer program product for remote 
authentication based on exchanging signals representing biometrics information and 
establishing a cryptographic key have been described. While only a small number of 
20 embodiments are described, it will be apparent to those skilled in the art, in view of 
this disclosure, that numerous changes and/or modifications can be made without 
departing from the scope and spirit of the invention. 



